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AMENDMENTS TO THE CLAIMS 

Claim 1 (currently amended): A computer-implemented method for searchdng a 
collection of items, wherein each item in the collection has a set of properties, comprising the 
steps of: 

obtaining a query composed of a first set of one or more properties; and 

obtaining a result based on applying a distance function to the query and an item 
in the collection having a second set of one or more properties, wherein 

obtaining a result includes determining a third set of properties common to the 
first set of one or more properties and the second set of one or more properties, and 

the distance function determines a distance between the query and the item in the 
collection based on the numb e r a frequency of occurrence of items in the collection that have ate 
associated with all of the properties in the third set of properties, wherein a higher number 
frequency of occurrence of items in the collection having associated with all of the properties in 
the third set of properties indicates a greater distance between the query and the item in the 
collection and a lower numb e r frequency of occurrence of items in the collection having 
associated with all of the properties in the third set of properties indicates a smaller distance 
between the query and the item in the collection: and 

providing a representation of the result to a user. 

Claim 2 (original): The method of claim 1, further including the step of associating each 
item in the collection with a set of properties. 

Claim 3 (previously presented): The method of claim 1, wherein the step of obtaining a 
result includes identifying one or more result items whose distance from the query is within a 
first threshold. 

Claim 4 (previously presented): The method of claim 3, wherein the step of obtaining a 
result includes ranking the one or more result items according to their distance from the query. 

Claim 5 (original): The method of claim 3, wherein the threshold is defined as a number 
of result items. 
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Claim 6 (original): The method of claim 3, wherein the threshold is defined as a 
distance. 

Claim 7 (original): The metiiod of claim 1, furdier including.the step of returning the 

result. 

Claim 8 (original): The method of claim 1, wherem the step of obtaining a query 
includes the step of mapping a received query to a set of one or more properties. 

Claim 9 (original): The method of claim 1, wherein one or more of the properties are 

binary. 

Claim 10 (original): The method of claim 1, wherein one or more of the properties are 
related by a partial order, and wherein, if an item is associated with a property, then the item is 
also associated with all ancestors of that property in the partial order. 

Claim 1 1 (previously presented): The method of claim 10, wherein one or more of the 
properties represent nxmierical values or ranges, and wherein the partial order reflects a set of 
containment relationships among the numerical values or ranges. 

Claim 12 (original): The method of claim 1, wherein the properties are grouped into 
equivalence classes. 

Claim 13 (original): The method of claim 12, ftirther including flie step of grouping the 
properties into equivalence classes using clustering. 

Claim 14 (original): The method of claim 13, wherein each property has a set of 
subproperties, wherein the clustering is performed such that the distance between two properties 
in the collection is correlated to the nximber of properties in the collection tiiat are associated 
with all of the subproperties common to both properties. 

Claim 15 (original): The method of claim 1, wherein the query corresponds to a single 
item in the collection. 
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Claim 16 (original): The method of claim 1, wherein the query corresponds to a plurality 
of items in the collection. 

Claim 17 (original): The method of claim 1 , wherein the query is independent of the 
items in the collection. 

Claim 18 (original): The method of claim 1, wherein the step of obtaining a result is 
constrained to a subcoUection of the items in the collection. 

Claim 19 (original): The method of claim 18, wherein the subcoUection is specified as 
an expression of properties. 

Claim 20 (original): The method of claim 19, wherein the expression includes a subset 
of the set of properties that compose the query. 

Claim 21 (original): The method of claim 1, wherein the step of obtaining a query 
includes identifying certain properties to be ignored in the step of obtaining a result. 

Claim 22 (original): The method of claim 1, wherein the distance function is applied 
explicitly. 

Claim 23 (original): The metiiod of claim 1, wherein the distance function is applied 
implicitly. 

Claim 24 (original): The method of claim 23, wherein the step of obtaining a result 
includes the step of iterating a random walk process to select potential result items. 

Claim 25 (original): The method of claim 24, wherein the step of obtaining a result 
includes ranking the potential result items by frequency and selecting the potential result items 
having higher frequencies. 

Claim 26 (original): The method of claim 23, wherein the step of obtaining a result 
includes iterating through one or more subsets of the query and identifying items associated with 
the one or more subsets. 
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Claim 27 (original): The method of claim 26, wherein the one or more subsets are 
prioritized according to the number of items in the collection that have all of the properties in 
each subset and wherein iterating through one or more subsets of the query is continued until a 
first threshold is reached. 

Claim 28 (original); The method of claim 1, wherein the step of obtaining a result 
includes applying a Euclidean distance fimction. 

Claim 29 (original); The method of claim 28, wherein the step of obtaining a result 
includes merging a first result determined by applying the distance function and a second result 
determined by applying the Euclidean distance function. 

Claim 30 (original): The method of claim 28, wherein the step of obtaining a result 
includes determining a first result by applying either the distance fimction or the Euclidean 
distance function and applying the other distance function to the first result. 

Claim 3 1 (currently amended); A computer-implemented method for analyzing two sets 
of properties from a plurality of sets of properties of items in a collection being searched, each 
item in the collection having a set of properties in the plurality of sets of properties, the method 
comprising the steps of: 



the plurality of sets of properties^ that include aU the s e t of common properties; 

assessing the distance between the two sets of properties as a fimction of the 
frequency of occurrence numb e r of sets of properties that include all the setof common 




the set-ef common properties indicates a greater distance, and a lower fii'equencv of occurrence 
number of sets of properties that include all the set-ef common properties indicates a smaller 
distance; and 





properties, wherein a higher 




icv of occurrence number of sets of properties that include all 



providing a representation of the distance to a user. 
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Claim 32 (currently amended): A computer-implemented method for analyzing the 
relationship between two items in a collection of items, wherein each item in the collection is 
associated with a set of properties, comprising the steps of: 

obtaining a set of properties with which the two items are commonly associated; 

and 

determining the degree of commonality between the two items as a function of 
th e number a frequency of occurrence of items in the collection that have are associated with all 
of the properties witih which the two items are commonly associat e d have in conmion, wherein a 
higher number frequencv of occurrence of items in the collection having aasooiatod with all of 
the properties wiftt which the two items have m common ar e commonly aBsooiat e d indicates a 
lesser degree of commonality and a lower numb e r fre quencv of occurrence of items in the 
collection having associated with all of the properties wife which the two items have in common 
ar e commonly associat e d indicates a greater degree of commonality; and 

providing a representation of die degree of commonality to a user. 

Claun 33 (ciurently amended): A computer program product, residmg on a computer 
readable medium, for use in searching a collection of items, wherein each item in the collection 
is associated with a set of properties, the computer program product comprising instructions for 
causing a computer to: 

receive a query composed of one or more properties; and 
obtain a result based on applying a distance fimction to the query and an item in 
the collection having a second set of one or more properties, 

wherein the distance fimction determines a third set of properties common to the 
first set of one or more properties and the second set of one or more properties, and determines a 
distance between the query and the item in the collection based on th e numb e r a frequencv of 
occurrence of items in the collection that have are associat e d with all of the properties in the 
third set of properties, wherein a higher HaHib«F frequencv of occurrence of items in the 
collection having associated with all of the properties in the third set of properties indicates a 
greater distance between the query and the item in the collection and a lower number frequencv 
of occurrence of items in the collection having associated with all of the properties in the third 
set of properties indicates a smaller distance between the query and the item in the collection : 
and 
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provide a representation of the result to a user. 

Claim 34 (original): The computer program product of claim 33, wherein the 
instructions cause the computer to obtain a result by identifying exactly the items whose distance 
from the query is within a threshold. 

Claim 35 (original): The computer program product of claim 33, wherein the 
instructions cause the computer to obtain a result by identifying approximately the items whose 
distance from the query is within a threshold according to a heuristic. 

Claim 36 (original): The computer program product of claim 35, wherein the heuristic 
permits a trade-off between the accuracy and the performance of a search. 

Claim 37 (original): The computer program product of claim 35, wherein the heuristic 
includes the use of a random walk process. 

Claim 38 (currently amended): A computer system for managing data records 
comprising: 

an information retrieval subsystem that stores and retrieves data records, each 
data record being associated with a set of properties; and 

a similarity search subsystem that receives similarity search queries and processes 
similarity search queries based on a distance function, a similarity search query being associated 
with a first set of properties, 

wherein the distance function determines a distance between the query and a data 
record in the collection having a second set of properties based on determining a third set of 
properties common to the first set of properties and the second set of properties, and determining 
the numb e r frequency of occurrence of data records in the collection that are associated with all 
of the properties in the third set of properties, wherein a higher numb e r fre quency of occurrence 
of data records associated with all of the properties in the third set of properties indicates a 
greater distance between the query and the item and a lower numb e r frequency of occurrence of 
data records associated with all of the properties in the third set of properties indicates a smaller 
distance between the query and the item. 
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Claim 39 (original): The computer system of claim 38, further including a clustering 
subsystem that employs the distance function of the similarity search subsystem to construct a 
graph. 

Claim 40 (cancelled). 



Claim 41 (cancelled). 



